NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Cooperation and Competition of Base Pairing and Electrostatic Interactions in Mixtures of DNA Nanostars and Polylysine

https://doi.org/10.1021/jacs.5c12436

Abraham, Gabrielle R; Li, Tianhao; Nguyen, Anna; Jacobs, William M; Saleh, Omar A (November 2025, Journal of the American Chemical Society)

Free, publicly-accessible full text available November 10, 2026
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models

https://doi.org/10.1609/aaai.v39i22.34540

Cox, Kyle; Xu, Jiawei; Han, Yikun; Xu, Rong; Li, Tianhao; Hsu, Chi-Yang; Chen, Tianlong; Gerych, Walter; Ding, Ying (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

An interesting behavior in large language models (LLMs) is prompt sensitivity. When provided with different but semantically equivalent versions of the same prompt, models may produce very different distributions of answers. This suggests that the uncertainty reflected in a model's output distribution for one prompt may not reflect the model's uncertainty about the meaning of the prompt. We model prompt sensitivity as a type of generalization error, and show that sampling across the semantic concept space with paraphrasing perturbations improves uncertainty calibration without compromising accuracy. Additionally, we introduce a new metric for uncertainty decomposition in black-box LLMs that improves upon entropy-based decomposition by modeling semantic continuities in natural language generation. We show that this decomposition metric can be used to quantify how much LLM uncertainty is attributed to prompt sensitivity. Our work introduces a new way to improve uncertainty calibration in prompt-sensitive language models, and provides evidence that some LLMs fail to exhibit consistent general reasoning about the meanings of their inputs.
more » « less
Free, publicly-accessible full text available April 11, 2026
CancerGPT for few shot drug pair synergy prediction using large pretrained language models

https://doi.org/10.1038/s41746-024-01024-9

Li, Tianhao; Shetty, Sandesh; Kamath, Advaith; Jaiswal, Ajay; Jiang, Xiaoqian; Ding, Ying; Kim, Yejin (December 2024, npj Digital Medicine)

Abstract Large language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology and medicine has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Here we report our proposed few-shot learning approach, which uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrate that the LLM-based prediction model achieves significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with ~ 124M parameters), is comparable to the larger fine-tuned GPT-3 model (with ~ 175B parameters). Our research contributes to tackling drug pair synergy prediction in rare tissues with limited data, and also advancing the use of LLMs for biological and medical inference tasks.
more » « less
Full Text Available
Predicting the Morphology of Multiphase Biomolecular Condensates from Protein Interaction Networks

https://doi.org/10.1103/PRXLife.2.023013

Li, Tianhao; Jacobs, William M (June 2024, PRX Life)

Phase-separated biomolecular condensates containing proteins and RNAs can assemble into higher-order structures by forming thermodynamically stable interfaces between immiscible phases. Using a minimal model of a protein/RNA interaction network, we demonstrate how a “shared” protein species that partitions into both phases of a multiphase condensate can function as a tunable surfactant that modulates the interfacial properties. We use Monte Carlo simulations and free-energy calculations to identify conditions under which a low concentration of this shared species is sufficient to trigger a wetting transition. We also describe a numerical approach based on classical density functional theory to predict concentration profiles and surface tensions directly from the model protein/RNA interaction network. Finally, we show that the wetting phase diagrams that emerge from our calculations can be understood in terms of a simple model of selective adsorption to a fluctuating interface. Our work shows how a low-concentration protein species might function as a biological switch for regulating multiphase condensate morphologies. Published by the American Physical Society2024
more » « less
Full Text Available
GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing

https://doi.org/10.18653/v1/2025.naacl-long.287

Duan, Jinhao; Zhao, Xinyu; Zhang, Zhuoxuan; Ko, Eunhye Grace; Boddy, Lily; Wang, Chenan; Li, Tianhao; Rasgon, Alexander; Hong, Junyuan; Lee, Min Kyung; et al (January 2025, Association for Computational Linguistics)

Full Text Available
Competition between Self-Assembly and Phase Separation Governs High-Temperature Condensation of a DNA Liquid

https://doi.org/10.1103/PhysRevLett.132.208401

Hegde, Omkar; Li, Tianhao; Sharma, Anjali; Borja, Marco; Jacobs, William_M; Rogers, W_Benjamin (May 2024, Physical Review Letters)
Interplay between self-assembly and phase separation in a polymer-complex model

https://doi.org/10.1103/PhysRevE.108.064501

Li, Tianhao; Rogers, W. Benjamin; Jacobs, William M. (December 2023, Physical Review E)

Full Text Available
Improving model fairness in image-based computer-aided diagnosis

https://doi.org/10.1038/s41467-023-41974-4

Lin, Mingquan; Li, Tianhao; Yang, Yifan; Holste, Gregory; Ding, Ying; Van Tassel, Sarah H.; Kovacs, Kyle; Shih, George; Wang, Zhangyang; Lu, Zhiyong; et al (December 2023, Nature Communications)

Abstract Deep learning has become a popular tool for computer-aided diagnosis using medical images, sometimes matching or exceeding the performance of clinicians. However, these models can also reflect and amplify human bias, potentially resulting inaccurate missed diagnoses. Despite this concern, the problem of improving model fairness in medical image classification by deep learning has yet to be fully studied. To address this issue, we propose an algorithm that leverages the marginal pairwise equal opportunity to reduce bias in medical image classification. Our evaluations across four tasks using four independent large-scale cohorts demonstrate that our proposed algorithm not only improves fairness in individual and intersectional subgroups but also maintains overall performance. Specifically, the relative change in pairwise fairness difference between our proposed model and the baseline model was reduced by over 35%, while the relative change in AUC value was typically within 1%. By reducing the bias generated by deep learning models, our proposed approach can potentially alleviate concerns about the fairness and reliability of image-based computer-aided diagnosis.
more » « less
Full Text Available
EyeQoE: A Novel QoE Assessment Model for 360-degree Videos Using Ocular Behaviors

https://doi.org/10.1145/3517240

Zhu, Huadi; Li, Tianhao; Wang, Chaowei; Jin, Wenqiang; Murali, Srinivasan; Xiao, Mingyan; Ye, Dongqing; Li, Ming (March 2022, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies)

As virtual reality (VR) offers an unprecedented experience than any existing multimedia technologies, VR videos, or called 360-degree videos, have attracted considerable attention from academia and industry. How to quantify and model end users' perceived quality in watching 360-degree videos, or called QoE, resides the center for high-quality provisioning of these multimedia services. In this work, we present EyeQoE, a novel QoE assessment model for 360-degree videos using ocular behaviors. Unlike prior approaches, which mostly rely on objective factors, EyeQoE leverages the new ocular sensing modality to comprehensively capture both subjective and objective impact factors for QoE modeling. We propose a novel method that models eye-based cues into graphs and develop a GCN-based classifier to produce QoE assessment by extracting intrinsic features from graph-structured data. We further exploit the Siamese network to eliminate the impact from subjects and visual stimuli heterogeneity. A domain adaptation scheme named MADA is also devised to generalize our model to a vast range of unseen 360-degree videos. Extensive tests are carried out with our collected dataset. Results show that EyeQoE achieves the best prediction accuracy at 92.9%, which outperforms state-of-the-art approaches. As another contribution of this work, we have publicized our dataset on https://github.com/MobiSec-CSE-UTA/EyeQoE_Dataset.git.
more » « less
Full Text Available
SCALP - Supervised Contrastive Learning for Cardiopulmonary Disease Classification and Localization in Chest X-rays using Patient Metadata

https://doi.org/10.1109/icdm51629.2021.00134

Jaiswal, Ajay; Li, Tianhao; Zander, Cyprian; Han, Yan; Rousseau, Justin F.; Peng, Yifan; Ding, Ying (December 2021, IEEE Conference Proceedings)

Full Text Available

Search for: All records